Ad Hoc Retrieval Experiments Using WordNet and Automatically Constructed Thesauri

نویسندگان

  • Rila Mandala
  • Takenobu Tokunaga
  • Hozumi Tanaka
  • Akitoshi Okumura
  • Kenji Satoh
چکیده

This paper describe our method in automatic-adhoc task of TREC-7. We propose a method to improve the performance of information retrieval system by expanded the query using 3 di ferent types of thesaurus. The expansion terms are taken from handcrafted thesaurus (WordNet), co-occurrence-based automatically constructed thesaurus, and syntactically predicate-argument based automatically constructed thesaurus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining General Hand-Made and Automatically Constructed Thesauri for Query Expansion in Information Retrieval

One of the most intuitive ideas for enhancing the effectiveness of an information retrieval system is to include the use of a thesaurus. WordNet, as a hand-crafted and general-purpose thesaurus, intuitively should also work fine in information retrieval, but unfortunately, experimental results by many researchers have not been promising. Thereby in this paper we investigate why the use of WordN...

متن کامل

An Association Thesaurus for Information Retrieval

Although commonly used in both commercial and experimental information retrieval systems, thesauri have not demonstrated consistent beneets for retrieval performance, and it is diicult to construct a thesaurus automatically for large text databases. In this paper, an approach, called PhraseFinder, is proposed to construct collection-dependent association thesauri automatically using large full-...

متن کامل

Complementing WordNet with Roget's and Corpus-based Thesauri for Information Retrieval

This paper proposes a method to overcome the drawbacks of WordNet when applied to information retrieval by complementing it with Roget 's thesaurus and corpus-derived thesauri. Words and relations which are not included in WordNet can be found in the corpus-derived thesauri. Effects of polysemy can be minimized with weighting method considering all query terms and all of the thesauri. Experimen...

متن کامل

A Two-Stage Retrieval Model for the TREC-7 Ad Hoc Task

A two-stage model for ad hoc text retrieval is proposed in which recall and precision are maximized sequentially. The rst stage employs query expansion methods using WordNet and on a modi ed stemming algorithm. The second stage incorporates a term proximity-based scoring function and a prototype-based reranking method. The e ectiveness of the two-stage retrieval model is tested on the TREC-7 ad...

متن کامل

Focused Search in Books and Wikipedia: Categories, Links and Relevance Feedback

In this paper we describe our participation in INEX 2009 in the Ad Hoc Track, the Book Track, and the Entity Ranking Track. In the Ad Hoc track we investigate focused link evidence, using only links from retrieved sections. The new collection is not only annotated with Wikipedia categories, but also with YAGO/WordNet categories. We explore how we can use both types of category information, in t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998